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AE008898 23880 bp DNA linear BCT 31-JUL-2002 

Salmonella typhimurium LT2, section 202 of 220 of the complete 
genome. 

AE008898 AE006468 
AE008898. 1 

• 

Salmonella typhimurium LT2 
Salmonella typhimurium LT2 

Bacteria; Proteobacteria; Gammaproteobacter i a; Enterobacteriales; 
Enterobacter iaceae ; Salmonella. 

1 (bases 1 to 23880) 

McCI el I and, M. , Sanderson, K. E. , Spieth, J. 
Latrei I Je, P. f Courtney, L., Porwo! I ik, S. , 
Hou,S., Layman, D. , Leonard, S. , Nguyen, C. 
Grewal,N. , Mulvaney, E. , Ryan,E., Sun,H. , 
Stoneking, T. , Nhan,M., Waterston,R. and Wi I son, R. K. 
Complete genome sequence of Salmonella enterica serovar Typhimurium 
LT2 

Nature 413 (6858), 852-856 (2001) 

2 1534948 

11677609 

2 (bases 1 to 23880) 

The Salmonella typhimurium Genome Sequencing Project. 
Direct Submission 

Submitted (29-MAR-2001) Genome Sequencing Center, Department of 
Genetics, Washington University School of Medicine, 4444 Forest 
Park Boulevard, St. Louis, M0 63108, USA 
COMMENT Supported by NIH grant 5U 01 AI43283 



Clifton, S.W., 
Ali f J., Dante, M. , Du,F. 

Scott, K. , Holmes, A. » 
Florea, L. , Mi I ler,W. , 



Coding sequences below are predicted from manually evaluated 
computer analysis, using similarity information and the programs; 
GLIMMER; http://www.tigr.org/softlab/glimmer/glimmer.html and 
GeneMark; http://opal. biology, gatech. edu/GeneMark/ 

EC numbers were kindly provided by Junko Yabuzaki and the Kyoto 
Encyclopedia of Genes and Genomes; http://www.genome.ad.jp/kegg/, 
and Pedro Romero and Peter Karp at EcoCyc; 
http://ecocyc. PangeaSystems. com/ecocyc/ 

The analyses of ribosome binding sites and promoter binding sites 
were kindly provided by Heladia Salgado, Julio Col I ado-Vi des and 
ReguonDB; 

http://kini ch. c i fn. unam. mx:8850/db/regulondb_i ntro. frameset 

This sequence was finished as follows unless otherwise noted: all 
regions were double stranded, sequenced with an alternate 
chemistries or covered by high quality data (i.e., phred quality >= 
30); an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by sequence 
from more than one ml 3 subclone. 
FEATURES Location/Qua! if iers 

sourc e 1.. 2 3880 

/organi sm= n Sa Imone 1 1 a typhimurium LT2" 

/strain="LT2; SGSC 1412; ATCC 700720" 

/db_xref="ATCC: 700720" 

/db_xref="taxon:99287" 

/note="LT2" 
gene 107. .1036 

/gene="malM" 

/note="synonym: STM4232" 
RBS 107.. 112 
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/gene="ma IM" 

/note=" putative RBS for malM; RegulonDB:STMS1H004137" 
CDS 119.. 1036 

/gene="malM" 

/note="mal tose operon per i p I asmi c protein precursor. 

(SW:MALM_SALTY)" 

/codon_start=l 

/trans l_tabl e=1 1 

/product="periplasmic protein of mal regulon" 
/protein_id= n AAL23056. V 
/db_xref="GI: 16422790" 

/transiat ion= n MKMKKSLVALCLTAGLFASVPG I SLAEVNYVPQNTSAAPV I PAA 
ALQQLTWTPVDQSKTQSTQLATGGQRLDVAG I TGPVAAYSVPAN I GELTLTLTSEVNK 
QASVFAPNVL I LDQNMTPSAFFPSSYFTYQQPGVMSADRLEGVMRLTPALGQQKLYVL 
VFTTEKDLQQTTTLLDPAKAYAKGVGNS I PD I PDPVARHTTDGVVKLKVKTNSSSSVL 
VGPLFGS5GTGPVTVGNTAAPVAAPAPVAPKKSEPMLNDTESYFNKA I KDAVAKGDVD 
KALKLLDEAERLGSTSARSTF I SSVKGKG" 
gene 1203.. 1714 ^ 

/gene="ubiC" 

/note=" synonym: STM4233" 
RBS 1203.. 1208 

/gene=" ub iC" 

/note=" putative RBS for ubiC; RegulonDB:STMS1H004138" 
CDS 1217.. 1714 

/gene=" ubiC" 
/EC_number="4 J _-. - M 

/note=" s imi I ar to E. coli chorismate lyase (AAC77009. 1) ; 
Blastp hit to AAC77009. 1 (202 aa), 86* identity in aa 38 - 
202" 

/codon_start=1 
/trans l_tab I e=1 1 

/product="chorismate pyruvate lyase" 
/protein_id="AAL23057. 1" 
/db_xref="GI : 16422791" 

/trans I at i on=" MSHPALTQLRALRYFDA I PALEPHLLDWLLLEDSMTKRFEQQGK 
RVSVTL I REAFVGQSEVEEASGLLPSESRYWLRE I LLCADGEPWLAGRTVVPESTLCG 
PEQVLQHLGKTPLGRYLFTSSTLTRDF I E 1 GRDATLWGRRSRLRLSGKPLLLTELFLP 
ASPLY" 

gene 1715. . 2600 

/gene=" ubiA" 

/note=" synonym: STM4234" 
RBS 1715. . 1 720 

/gene=" ub i'A" 

/note=" putative RBS for ubiA; RegulonDB:STMS1H004139" 
CDS 1728.. 2600 

/gene=" ub i A" 
/EC_numbe r=" 2. 5. 1. " 
/note=" similar to E. col i 

4-hydroxybenzoate-octapreny I transferase (AAC77010. 1) ; 
Blastp hit to AAC77010.1 (290 aa), 93% identity in aa 1 - 
290" 

/codon_start=l 
/transl_table=11 

/product=" p-hydroxybenzoate: octapreny 1 1 ransf erase" 
/protein_id="AAL23058. 1" 
/db_xref="GI : 16422792" 

/trans 1 at ion=" MEWSLTQSKLLAFHRLMRTDKPIGALLLLWPTLWALWVATPGMP 
QLW I LAVFVAGVWLMRAAGCVVNDYADRKFDGHVKRTVNRPLPSGAVTEKEARNLFVV 
LVLLAFLLVLTLNAMT I LLSVAALALAWVYPFHKRYTHLPQVVLGAAFGWS I PMAFAA 
VSESLPLSCWLMFLAN I Ltf AVAYDTQYAMVDRDDD IK1GI KSTA I LFGRYDTL 1 1 G I L 
QLGVMALMAL I GWLNGLGWGYYWAVLVAGALFVYQQKL I ANREREACFKAFMNNNYVG 
LVLFLGLAMSYWHF" 
gene complement (2699. .5136) 

/gene=" pi sB" 

/note=" synonym: STM4235" 
CDS complement (2699. . 51 19) 

/gene="pl sB" 
/EC number="2.3. 1. 15" 



/note =t, simi lar to E. coli glycerol-3-phosphate 

acy I transferase (AAC7701 1 . 1) ; Blastp hit to AAC77011.1 

(827 aa), 94* identity in aa 21 - 826" 
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/codon_start=1 
/trans l_table=1 1 

/product="glycero!phosphate acy I t ransferase activity" 
/protein i d=" AAL23059. 1" 
/db_xref="GI : 16422793" 

/translation*" MSGWPR I YYKLLNLPLS I LVKSKS I PAEPAQELGLDTSRP i MYV 
LPYNSKADLLTLRAQCLAHDLPDPLEPLE I DGALLPRYVF I HGGPRVFTYYTPKEESV 
KLFHDYLDLHRSNPALDVQMVPVSVMFGRAPGREKGEDNPPLRMLNGVQKFFA I SWLG 
RDSFVRFSPSVSLRRMADEHGTDK I IAQKLARVARMHFARQRLAAVGPRLPARQDLFN 
KLLASKA I ARAVEDEARSKK I SHEKAQQNA I ALMEE I AANFSYEM I RLTDR I LGFTWN 
RLYQG t NVHNAERVRQLAHDGHE I VYVPCHRSHMDYLLLSYVLYHQGLVPPH I AAG I N 
LNFWPAGP I FRRLGAFF I RRTFKGNKLYSTVFREYLGELFSRGYSVEYFVEGGRSRTG 
RLLDPKTGTLSMT I QAMLRGGTRP I TLVP I Y I GYEHVMEVGTYAKELRGATKEKESLP 
QMLKGLSKLRNLGQGYVNFGEPMPLMTYLNQHVPEWRES I DP I EA I RPAWLTPTVNS I 
AADLMVR I NNAGAANAMNLCCTALLASRQRSLTREQLTEQLDCYLDLMRNVPYSTDST 
VPAASAGEL I AHALQMNKFEVEKDT I GD I 1 1 LPREQAVLMTYYRNN I AHML I MPSLMA 
A 1 1 TQHRR I SRDALQQHVEALYPMLKAELFLRWEREELASV I DALASEMQRQGL I TLQ 
DDELH t NPTHSRTLQLLAAGARETLQRYA I TFWLLSANPS I NRSTLEKESRTVAQRLS 
VLHG I NAPEFFDKAVFSSLVLTLRDEGY I SDTGDAEPAETMK I YQMLADL I TSDVRLT 
IESATQGE" 

RBS complement (5131. . 5136) 

/gene="plsB" 

/note=" putative RBS for pIsB; Regul onDB:STHS1H004140 n 
gene 5278.. 5658 

/gene=" dgkA" 

/note=" synonym: STM4236" 
RBS 5278.. 5283 

/gene = " dgkA 

/note="putative RBS for dgkA; RegulonDB:STMS1H00414r 
CDS 5290. .5658 ^ 

/gene="dgkA" 
/EC_number=" 2 .7. 1. 107 " 

/note="simi lar to E. coli d iacy Iglycero I kinase 

(AAC77012. 1) ; Blastp h j t to AAC77012. 1 (122 aa), 8635 

ident i ty in aa 1 - 121" 

/codon_start=1 

/trans l_tab I e=1 1 

/product*" d iacy Iglycero! kinase" 

/protein_id="AAL23060. 1" 

/db_xref= n Gi : 16422794" 

/translation^ MANNTTGFTR 1 1 KAAGYSWKGFRAAWTHEAAFRQESLAVLLGV I 
I ACWLDVDA I TRVLL I GSVLL IMIVEI LNSA I EAVVDR I GSEYHELSGRAKDMGSAAV 
LLS I FVALMTWG I LLWSHFR" 
gene 5701.. 6375 

/gene=" I exA" 

/note=" synonym: STM4237" 
-35_s igna I 5701.. 5709 ^ 

/gene=" I exA" 

/note="putat ive -35_signal for lexA; 
RegulonDB:STMLTH004657" 
protein bind 5722.. 5738 ^ 

/gene=" I exA" 

/note=" putative binding site for LexA, RegulonDB: 
STMS1H000241" 
/bound_tno i e ty=" LexA" 
-10_s igna I 5725.- 5733 ( 

/gene=" I exA" 

/note="putat i ve -10„signal for lexA; 
RegulonDB:STMLTH004657" 
protein bind 5742.. 5762 

/gene=" I exA" 

/note="putative binding site for LexA, RegulonDB: 
STMS1H000249" 
/bound_mo i ety=" LexA" 
RBS 5756.. 5761 

/gene~" lexA" 

/note=" putative RBS for lexA; RegulonDB:STMS1H004142" 
CDS 5767.. 6375 

/gene=" I exA" 
/EC number=" 3.4.21.88 " 

/note="transcriptional repressor (LexA family); LEXA 
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repressor. (SW:LEXA_SALTY)" 

/codon_start=1 

/ 1 rans l_tab 1 e=l 1 

/product="SOS response regulator" 
/protein_id="AAL23061. 1" 
/db_xref="Gi: 16422795* 

/t rans I a t i on=" MKALTARQQEVFDL I RDH I SQTGMPPTRAE I AQRLGFRSPNAAE 
EHLKALARKGVLE I VSGASRG I RLLQEEEDGLPLVGRVAAGEPLLAQQH IEGHYQVDP 
SLFKPSADFLLRVSGMSMKD I G I MDGDLLAVHKTQDVRNGQVVVAR 1 DDEVTVKRLKK 
QGNKVELLPENSEFTP I VVDLREQSFT I EGLAVGV I RNGEWL" 
gene 6540. . 7878 

/gene="dinF" 

/note=" synonym: STH4238" 
RBS 6540.. 6545 

/gene="dinF" 

/note=" putative RBS for dinF; RegulonDB:STMS1H004143" 
CDS 6553.. 7878 

/gene="dinF" 

/note-* 1 induced by UV and mitomycin C; SOS, lexA regulon; 

similar to E. coli DNA-damage-inducible protein F 

(AAC77014. 1) ; Blastp hit to AAC77014. 1 (459 aa), 89% 

i dent i ty in aa 19 - 457" 

/codon_start=1 

/transl__table=11 

/product= M DNA-damage-inducible protein F" 
/protein id="AAL23062. I" 
/db_xref="GI: 16422796" 

/translate on=" MPLFTSSDKALWRLALPM I FSN I TVPLLGLVDTAV I GHLDSPVF 
LGGVAVGATATSFLFMLLLFLRMSTTGLTAQAFGAKNPQALARAL I QPLLLALGAGVM 
I VLFRTPL I ELALH I VGGNDAVLVQARRFLE 1 RWLSAPASLANLVLLGWLLGVQYARA 
PV I LLVVGN I LN I ALDLWLVHGLHMNVQGAALATV I AEYVTLL I GLMMVRKVLHLRGV 
S LDMLKQAWRGNVRRLLALNRD I MLRSLLLQLCFGA I TVSGARLGSD 1 1 AVNAVLMTL 
LTFTAYALDGFAYAVEAHSGQAYGARDGSKLLDVWRAACRQSG I VALLFSTVYALAGE 
H I VALLTSLPQ I QLLADRYL I WQVVLPLVGVWCYLLDGMF I GATRAAEMRNSMAVAAG 
GFALTLFALPVLGNHGLWLALTVFLALRGLSLSL I WRRHWREGTWFARS" 

gene 7875. . 7988 

/gene=" STM4239" 

CDS 7875.. 7988 

/gene="STM4239" 
/note="hypothetical protein" 
/codon_start=1 
/transl_table=1 1 

/product="putative cytoplasmic protein" 
/protein_id="AAL23063. 1" 
/db_xref="GI : 16422797" 

/t rans I at i on=" MTVKNSEYKTEGR I PAYNYYHVQQKLQS I WANNTCYS" 
gene 7998. .8219 

/gene=" yj bJ" 

/note=" synonym: STM4240" 
RBS 7998.. 8003 

/gene= w yjbJ" 

/note=" putative RBS for yjbJ; RegulonDB:STMS1H004144" 
CDS 8007. .8219 

/gene=" yj bJ" 

/note=" simi lar to E. coli orf, hypothetical protein 
(AAC77015. 1) ; Blastp hit to AAC77015. 1 (69 aa), 95% 
identity in aa 1 - 69" 
/codon_start=1 
/trans I t a b I e=1 1 

/product="putative cytoplasmic protein" 
/protein_id= n AAL23064. 1" 
/db_xref="GI : 16422798" 

/t rans 1 at i on=" MMNKDEAGGNWKQFKGKMKEQWGKLTDDDMTV I EGKRDQLVGK I 
QERYGYQKDQAEKEVVDWETRNNYRW" 
gene compl ement (8318. . 8843) 

/gene= zur 

/note-' synonym: STM4241" 
CDS complement (8318. . 8833) 

/gene-" zur" 

/note="Fur family; similar to E. coli putative regulator 
(AAC77016.1); Blastp hit to AAC77016.1 (191 aa), 92% 
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identity in aa 21 - 191" 

/codon_start=1 

/transl_table=11 

/product=" transcriptional repressor of znuABC operon" 
/protein_id= n AAL23065. 1" 
/db_xref="GI: 16422799" 

/ 1 rans I a t i on=" MEKTTTQELLAQAEKLCAQRNVRLTPQRLEVLRLMSLQQGA I SA 
YDLLDLLRETEPQAKPPTI YRALDFLLEQGFVHKVESTNSYVVCHLFOQPTHSSAMFI 
CDRCGVVKEECAEGVED I MHTLAAKMGFALRHNV I EAHGLCPACVEVEACRHPGNCGH 
DHSVLVKKKPR" 
RBS complement (8838. . 8843) 

/gene="zur" 

/note= w putative RBS for zur; RegulonDB:STMS1H004145" 
gene 9076.. 10390 

/gene=" STM4242" 
RBS 9076. .9081 

/gene="STM4242" 

/no te=" putative RBS for STM4242; Regu lonDB:STMS1H004146" 
CDS 9080. .10390 

/gene="STM4242 M 
/codon_start=1 
/trans l„tab I e=l 1 

/product="putat i ve outer membrane or exported protein" 
/proteinJd="AAL23066. T 
/db_xref="GI : 16422800" 

/translation^ MNKNVKLSL I A I AVSLFMAKQASAANTWTEARNDAMGGTGVASA 
NYGSGVLLNPALLAKAKPEDN I T VVLPAVGVQ I TDKDNLQDE I DD I SDK VDYYDEVVD 
NLTLGQ I LLNPRGVLNQFQGAARDLADELEYLNGKTARANAGAGLAVS I PGQTLS VAF 
I AKGYAHGRVSSS I DQND I QYLRD I QHDERVALREAGRAALLGSDE I TKHLNSTASGR 
VA ( VSDYG t ALAKQFVVGE VP VS I GVTPKLQKTWL YNYTTS I YNYDSSDWNSSRYRND 
DTGFN I DAGLAAD i GENWTLGVSGQNLVS RD I DTKD I Y I TNGMTGETTNYKDTYQ I RP 
LVTAG I AWHNDLLTVSADGDLTETKGFKSEDNSQYVGVGAEVRPLSWLAVRAGYRADV 
KNNDSNVVTGGLGFAPFNRVHLDLMGLYGEDETWGAGAQLTMTF" 
gene 10478. .11476 

/gene="yjbN" 

/note=" synonym: STM4243" 
CDS 10478. .11476 

/gene="yjbN" 

/note=" poss i bly dehydrogenases; nifR3 family; similar to 
E. coli orf, hypothetical protein (AAC77019. 1) ; Blastp hit 
to AAC77019. 1 (345 aa) , 90% identity in aa 7 - 345" 
/codon_start=1 
/trans l_table=1 1 

/product=" putat i ve TIM-barrel enzyme" 
/protein_id="AAL23067. 1" 
/db_xref="GI: 16422801" 

/translation^ MQPETQSSALPAYRFS I APMLDWTDRHCRYFLRLLSRQTQLYTE 
MVTTGAI IHGKGDYLAYSEEEHPVALQLGGSDPAQLAHCAKLAEARGYDEINLNVGCP 
SDRVQNGMFGACLMGNAQLVADCVKAMRDVVS I PVTVKTR I G I DDQDSYAFLCDF I DT 
VSGQGECEMF 1 1 HARKAWLSGLSPKENRE I PPLDYPRVYQLKRDFPHLTMS I NGGtKS 
LEEAKEHLRHMDGVMVGREAYQNPG I LAAVDRE I FGADTTDADPVAVVRAMYPY I ERE 
LSQGAYLGH I TRHMLGLFQG I PGARQWRR YLSENAHKAGADVAVLEQALKL VADKR" 
gene 11631.. 11886 

/gene="yjb0 w 

/note=" synonym: STM4244" 
RBS 11631.. 11636 

/gene="yjb0 n 

/note=" putative RBS for yjbO; RegulonDB:STMSlH004147" 
CDS 11644.. 11886 

/gene="yjbO" 

/no te^ similar to E. coli orf, hypothetical protein 
(AAC77020. 1) ; Blastp hit to AAC77020. 1 (150 aa), 82% 
ident i t y in aa 71 — 150" 
/codon_start=1 
/t rans l_tab I e=1 1 

/produc t=" putat i ve inner membrane protein" 
/proteinJd="AAL23068. 1" 
/db_xref="GI : 16422802" 

/translation^ MLELLFVLGFFLMLMVTGVSLLG I LAAL VVATAVMFLGGMFALM 
I KLLPWLLLAVAVVWV I KAVKTPK I PQYQRNNRRFY" 
gene complement (12061. . 13052) 
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/gene="qor" 

/note=" synonym: STM4245" 
CDS complement (12061. .13044) 

/gene="qor" 
/EC nutnber=" 1.6.5. 5 " 

/no te=" $ i mi I ar to E. coli quinone oxi doreductase 
(AAC77021.1); Blastp hit to AAC77021.1 (327 aa) , 90% 
i dent i ty in aa 1 - 327" 
/codon_star t=1 
/transl_table=11 

/product="NADPH dependent quinone oxi doreductase" 
/protein_id="AAL23069. 1" 
/db_xref="GI : 16422803" 

/t rans I a t i on=" MATR I EFHKHGGPEVLQTVEFTPAEPAEHE I QVENKA IGINFID 
TY I RSGLYPPPSLPAGLGTEAAGVVSKVGNGVEH I RVGDRVVYAQSTLGAYSSVHNVT 
ADKAA I LPDA I SFEQAAASFLKGLTVFYLLRKTYEVKPDEPFLFHAAAGGVGL I ACQW 
AKALGAKL I GTVGSAQKAQRALDAGAWQV I NYREES I VERVKE I TGGKKVR VVYDSVG 
KDTWEASLDCLQRRGLMVSFGNASGPVTGVNLG I LNQKGSLYATRPSLQGY I TTREEL 
TEASNELFSL I ASGV I KVDVAENQRYALKDARRAHEVLESRATQGSSLL I P" 
RBS complement(13047. . 13052) 

/gene="qor" 

/note=" putative RBS for qor; RegulonDB:STMS1H004148" 
gene 13091. .14524 

/gene=" dnaB" 

/note=" synonym: STM4246" 
RBS 13091.. 13096 

/gene=" dnaB" 

/note=" putative RBS for dnaB; Regul onDB:STMS1H004149" 
CDS 13109. .14524 

/gene="dnaB" 
/EC_numbe r=" 3. 6. 1. -" 

/note=" chromosome replication; chain elongation; 
replicative DNA helicase. (SW:DNAB_SALTY)" 
/codon_star t=1 
/transl_table=11 

/product="putative replicative DNA helicase" 
/protein_id="AAL23070. 1" 
/db_xref="GI : 16422804" 

/t rans I at ion=" MAGNKPFNKPQTDARDRDPQVAG I KVPPHS I EAEQSVLGGLMLD 
NERWDDVAERVVAEDFYTRPHRH I FTEMGRLQESGSP I DL I TLAESLERQGQLDSVGG 
FAYLAELSKNT PSAAN I SAYAD I VRERAVVRDM I AVAHE I ADAGYDPQGRNSDELLDL 
AESRVFQ I AENRANKDEGPKS I DQ I LDATVAR ! EQLFQQPHDGVTGVDTGYQDLNKKT 
AGLQRSDL I I VAARPSMGKTTFAMNLCENAAMLQDKPVL I FSLEMPGEQ I MMRMLAS L 
SRVDQTR I RTGQLDDEDWAR I SGTMG I LLEKRNMY I DDSSGLTPTEVRSRARR I FREH 
GGLSL I M I DYLQLMRVPSLSDNRTLE I AE I SRSLKALAKELQVPVVALSQLNRSLEQR 
ADKRPVNSDLRESGS I EQDADL I MF I YRDEVYHENSDLKG I AE 1 1 I GKQRNGP I GTVR 
LTFNGQWSRFDNYAGPQYDDE" 
gene 14545. .15635 

/gene=" a I r" 

/note=" synonym: STM4247" 
RBS 14545. .14550 

/gene=" a I r" 

/note=" putative RBS for air; RegulonDB:STMS1H004150" 
CDS 14556.. 15635 

/gene=" a I r" 
/EC number-" 5. 1. 1. 1 " 

/note="alanine racemase, bi osynthet i c. (SW:ALR1_SALTY) 
/codon_star t=1 
/transT_table=1 1 

/product= M biosynthetic alanine racemase 1" 
/protein_id="AAL23071. 1" 
/db_xref="GI: 16422805" 

/t rans I at i on=" MQAATVV INRRALRHNLQRLRELAPASKLVAVVKANAYGHGLLE 
TARTLPDADAFGVARLEEALRLRAGG I TQP I LLLEGFFDAADLPT I SAQCLHTAVHNQ 
EQLAALEAVELAEPVTVWMKLDTGMHRLGVRPEEAEAFYQRLTHCKNVRQPVN I VSHF 
ARADEPECGATEHQLD I FNAFCQGKPGQRS I AASGG I LLWPQSHFDWARPG I I LYGVS 
PLEHKPWGPDFGFQPVMSLTSSL I AVRDHKAGEPVGYGGTWVSERDTRLGVVAMGYGD 
GYPRAAPSGTP VLVNGREVP I VGRVAMDM I CVDLGPNAQDNAGDPVVLWGEGLPVER I 
AEMTKVSAYEL I TRLTSRVAMKY I D" 
gene 15773. .17014 

/gene=" ty rB" 
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/note*" synonym: STM4248" 
-35_s i gnal 15773.. 15781 

/gene=[tyrB" 

/note="putat i ve -35_signal for tyrB; 
Regu I onDB : STMLTH004658" 
protein bind 15794.. 15816 

/gene="tyrB" 

/note=" putat i ve binding site for TyrR, RegulonDB: 
STMS1H000389" 
/bound_mo iety=" TyrR" 
-10_signal 15800.. 15808 

/gene=" tyrB" 

/note="putative -10_signal for tyrB; 
RegulonDB:STMLTH004658" 
RBS 15806.. 1581 1 

/gene=" tyrB" 

/note=" putative RBS for tyrB; Regu I onDB : STMS1 H0041 51" 
protein_bind 15820. . 15844 

/gene=" tyrB" 

/note=" putat ive binding site for TyrR, RegulonDB: 
STMS1H000384" 
/bound_mo i e ty=" Ty rR" 
CDS 15821. 17014 

/gene=" tyrB" 
/EC_number=" 2.6.1.57 " 

/note=" tyros i ne repressible; aromat i c-ami no-ac i d 
aminotransferase. (SW:TYRB_SALTY)" 
/codon_start=1 
/ 1 rans l_tab I e=1 1 

/product^" tyros i ne ami not rans f erase" 
/proteinJd="AAL23072. 1" 
/db_xref="GI : 16422806" 

/translation^ MFQKVDAYAGDP I LSLMERFKDDSRHDKVNLS I GLYYNEDG 1 1 P 
QLKTVAEAEARLNAQPHGASLYLPMEGLNTYRHT I APLLFGADHPVLQQQRVAT I QTL 
GGSGALKVGADFLKR YFPDAG VWVSDPTWENH I A I FAGAGFEVSTYPWYDDATNG I RF 
NDLLATLNTLPARS I VLLHPCCHNPTGADLTPSQWDAV I E I VKARDL I PFLD I AYQGF 
GAGMDDDAY V I RA I ASAGLPALVSNSFSK I FSLYGERVGGLS VVCEDAE I AARVLGQL 
KATVRR I YSSPPCFGAQVVATVLGDEALKAGWLAEVDAMRNR 1 1 SMRQTLVKELKAEM 
PDRNFDYLLQQRGMFSYTGLSEEQVDRLRDEFGVYL I ASGRMCVAGLNASNVHRVAKA 
FAAVM" 

gene 17190.. 17914 

/gene="faphA"^ 

/note=" synonym: STM4249" 
RBS 17190.. 17195 

/gene=" aphA" 

/note=" putative RBS for aphA; Regu I onDB:STMS1H004152" 
CDS 17201.. 17914 

/gene=" aphA" 
/EC number=" 3. 1.3.2 " 

/note=" s imi I ar to E. coli diadenosine tetraphosphatase 

(AAC77025. 1) ; Blastp hit to AAC77025. 1 (237 aa), 89% 

identi ty in aa 1 - 237" 

/codon_start=1 

/transl_table=11 

/product="non-specif ic acid 

phosphatase/phosphotransferase, class B" 

/protein_id="AAL23073. 1" 

/db_xref="GI : 16422807" 

/trans I at ion="MKKI TLALSAVCLLFTLNHSANALVSSPSTLNPGTNVAKLAEQA 
PVHWVSVAQ I ENSLTGRPPMAVGFD I DDTVLFSSPGFWRGKKTYSPDSDDYLKNPAFW 
EKMNNGWDEFS I PKEVARQL I DMHVRRGDS I YFVTGRSQTKTETVSKTLADNFH I PAA 
NMNPV I FAGDKPEQNTK VQWLQEKNMR I FYGDSDND I TAARDCG IRGIRI LRAANSTY 
KPLPQAGAFGEEV I VNSEY" 
gene 18031.. 18459 

/gene-' yjbQ" 

/note=" synonym: STM4250" 
RBS 18031.. 18036 

/gene=" yj bQ" 

/note="putative RBS for yjbQ; RegulonDB:STMS1H004153" 
CDS 18043.. 18459 

/gene-' yj bQ" 
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gene 



RBS 



CDS 



g ene 
CDS 



gjsne 
CDS 



RBS 



RBS 



gene 



CDS 



/note=" s imi I ar to E. coli orf, hypothetical protein 
(AAC77026. 1) ; Blastp hit to AAC77026. 1 (138 aa), 78X 
identity in aa 1 - 138" 
/codon_start=1 
/transl_table=1 1 

/product="putat ive cytoplasmic protein* 1 
/protein_id=" AAL23074. 1" 
/db_xref= M GI : 16422808" 

/translation^ MWYQRT I TLSEKPRGFHL I TDE I TDKLSGLPPVETGLLHLLLLH 
TSASLTLNENCDPTVRADMERHFLKTVPDNAAYEHDYEGADDMPSHI KSSVLGVSLLL 
P VRQGRLQLGTWQG I WLGEHR I HGGPRK 1 1 ATLQGE" 
18449. . 18818 
/gene="yjbR" 

/note=" synonym: STM4251" 
18449.. 18454 
/gene=" yjbR" 

/note=" putative RBS for yjbR; Regul onDB:STMS1H0041 54" 
18462.. 18818 
/gene=" yjbR" 

/note-' s imi I ar to E. coli orf, hypothetical protein 
(AAC77027. 1) ; Blastp hit to AAC77027. 1 (1 18 aa), 92% 
i dent i ty in aa 1 - 117" 
/codon_start=1 
/ 1 rans l_tab I e=1 1 

/product="putat i ve cytoplasmic protein" 
/protein_id="AAL23075. 1" 
/db_xref="GI: 16422809" 

/trans I at i on=" MT I SELLQYCMAKPGAEQSVHSDWKATQ I KVEDVLFAMVKEVEG 

RPAASLKTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQLYYLVDASYQQ 

AVNALSEDKRKQLPQS" 

complement (18819. . 19163) 

/gene=" STM4252" 

complement (18819. . 19157) 

/gene="STM4252" 

/codon_start=1 

/ 1 rans I _t ab I e=1 1 

/product="putat ive inner membrane protein" 
/protein_id=" AAL23076.1" 
/db_xref="GI: 16422810" 

/ 1 r ans I a t i on=" MAS I NE I HYL I TTAQAEHPVASSA I AEF I QTYKQAREDSDDA I R 
ESAAF I ARALQEHARGWLDDDDM 1 1 LLEGQRDLARLRANNAQ I ALGSR I RSTV I RL I D 
I ALALLVGAL" 

complement (19144. . 19602) 

/gene=" STM4253" 

compl ement (19144. . 19599) 

/gene="STM4253" 

/codon_start=1 

/trans I tab I e=1 1 

/produc t=" putat i ve outer membrane lipoprotein" 
/protein_id="AAL23077. 1" 
/db_xref="GI: 16422811" 

/t rans I at ion=" MENTMKLPYAITLLLCLFLSACTLPDRFSAVAFQQLTLLQARST 
RFLQDAAR I PWQKETLLKDDRD I RQ I FFQAERVARQGGDKHRLDNLALLKNHYLRLYA 
RVMRRKQPLTY I QAERYQQQNNQVWKLA I QGECLHWGARCTQGDENGVY" 
compl ement (19158. . 19163) 
/gene=" STM4252" 

/note="putative RBS for STM4252; Regu lonDB:STMS1H004155" 
complement (19597. . 19602) 
/gene=" STM4253" 

/note=" putative RBS for STM4253; Regu I onDB : STMS 1 H0041 56 M 
complement (19731. . 22719) 
/gene=" uvrA" 

/note=" synonym: STM4254" 
complement (19731.. 22556) 
/gene=" uvrA" 

/note="complexes with UvrBC; excinuclease ABC subunit A. 

(SW:UVRA_SALTY)" 

/codon_start=1 

/t rans l_tab le=1 1 

/product="DNA excision repair enzyme" 
/protein_id="AAL23078. 1" 
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/db_xref="GI: 16422812" 

/translation^ HDKI EVRGARTHNLKN I NLV I PRDKL I VVTGLSGSGKSSLAFDT 
LYAEGQRRYVESLSAVARQFLSLMEKPDVDH I EGLSPA I S I EQKSTSHNPRSTVGT I T 
E I HDYLRLLFAR VGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAP 1 1 KERKG 
EHTKTLENLASQGY 1 RAR I DGEVCDLSDPPKLELQKKHT I EVV I DRFKVRNDLSQRLA 
ESFETALELSGGTAVVADMDDEKAEELLFSANFACP I CGYSMRELEPRLFSFNNPAGA 
CPTCDGLGVQQYFDPDRV I QNPDLSLAGGA I RGWDRRNFYYFQMLKSLAEHYKFDVDA 
PWASLSANVHKVVLYGSGKEN I EFKYMNDRGDTSVRRHPFEGVLHNMERRYKETESSA 
VREELAKF I SNRPCASCEGTR LNREARHVFVENTPLPA I SDMS I GHAMDFFTNLKLSG 
QRAK I AEKVLKE I GDRLKFLVNVGLNYLTLSRSAETLSGGEAQR I RLASQI GAGLVGV 
MYVLDEPS I GLHQRDNERLLGTL I HLRNLGNTV I VVEHDEDA I RAADHV I D I GPGAGV 
HGGEVVAEGPLEA I MAVPESLTGQYMSGKRK I EVPKQRVPANPEK VLKLTGARGNNLK 
DVTLTLPVGLFTC I TGVSGSGKSTL I NDTLFP I AQRQLNGAT I AEPAPYRD I QGLEHF 
DKV I D I DQSPI GRTPRSNPATYTGVFTPVRELFAGVPESRSRGYTPGRFSFNVRGGRC 
EACQGDGV 1KVEMHFLPD I YVPCDQCKGKRYNRETLE I KYKGKT I HEVLDMT I EEARE 
FFDAVPALARKLQTLMDVGLTY I RLGQSATTLSGGEAQRVKLARELSKRGTGQTLY I L 
DEPTTGLHFAD I QQLLDVLHQLRDQGNT I VV I EHNLDV I KTADW I VDLGPEGGSGGGE 
I LVAGTPETVAECEASHTARFLKPMLK" 

gene 22521.. 22637 

/eene="STM4255 n 

CDS 22521.. 22637 

/gene="STM4255" 
/codon_start=1 
/trans l_tab I e=l 1 

/product^putat i ve cytoplasmic protein" 
/protein_id=" AAL23079. 1" 
/db_xref="GI : 16422813" 

/translation^ MGAGAPNFDL I HSPFPVETRMPGLFEGQTAETANYDT I " 
RBS complement (22558. . 22563) 

/gene=" uvrA" 

/note=" putative RBS for uvrA; RegulonDB:STMSlH004157" 
protei n bind 22638. . 22658 

/note="putat i ve binding site for LexA, RegulonDB: 

STMS1H000246" 

/bound_mo i ety=" LexA" 
protein bind complement (22638. . 22656) 

/gene= n uvrA" 

/note="putat i ve binding site for LexA, RegulonDB: 
STMS1H000244" 
/bound_mo i e ty=" LexA" 
-10_s i gna I comp I ement (22695. . 22703) 

/gene=" uvrA" 

/note="putat i ve -10_signal for uvrA; 
RegulonDB:STMLTH004659" 
gene 22704. .23334 

/gene=" ssb" 

/note^ synonym: STM4256" 
-35_signal 22704.. 22712 

/gene= n ssb" 

/note= n putati ve -35_signal for ssb; 
RegulonDB:STMLTH004660" 
-35_signal complement (22711. . 22719) 

/gene= h uvrA" 

/note=" putat i ve -35_signal for uvrA; 
RegulonDB:STMLTH004659" 
-10_s i gna I 22731. . 22739 

/gene=" ssb H 

/note="putat i ve -10_signal for ssb; 
Regu I onDB : STMLTH004660" 
RBS 22790.. 22795 

/gene=" ssb" 

/note=" putative RBS for ssb; Regu I onDB : STMS1H0041 58" 
CDS 22804.. 23334 

/gene= n ssb" 

/note=" controls activity of RecBCD nuclease; single-strand 

binding protein (SSB) (helix-destabilizing protein). 

(SW:SSB_SALTY) n 

/codon_start=t 

/transl_table=1 1 

/product="ssDNA-binding protein" 
/protein_id="AAL23080. 1" 
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/db_xref= n GI: 16422814" 

/translation^ MASRGVNKV I L VGNLGQDPEVRYMPSGGAVANLTLATSESWRDK 
QTGEMKEQTEWHRVVMFGKLAEVAGEYLRKGSQVY I EGQLRTRKWTDQSGQERYTTE I 
NVPQ I GGVMQMLGGRQGGGAPAGGQQQGGWGQPQQPQQPQGGNQFSGGAQSRPQQSAP 
APSNEPPMDFDDDI PF" 

BASE COUNT 5356 a 6155 c 6630 g 5739 t 

ORIGIN 

1 catcaggtct gatgatgctt gtaagccgaa taaggcgcca gccgccattc ggcgatatgt 

61 ttgggttggc gcgctattgc ctggctaccg ctgaacccgc cttattgagg tgagaataat 

121 gaaaatgaag aaaagtctcg tcgccctctg tttaaccgca gggctgttcg ccagcgtgcc 

181 gggtatcagc ctggcggaag tgaactatgt accgcaaaat accagcgccg cgcccgttat 

241 tcctgctgct gcgctacagc aattaacctg gacgcctgta gatcagtcaa aaacgcagtc 

301 tacccaactg gcgactggcg gccagcgcct tgacgtcgcc ggcattaccg gcccggtcgc 

361 cgcttatagc gttccggcga atatcggcga gctgacatta actctgacca gcgaggtcaa 

421 taagcaagcc agcgtttttg cgccaaacgt gttgattctt gaccaaaata tgacgccttc 

481 ggcgttcttc cccagcagtt actttactta tcagcagccg ggcgtcatga gcgcagaccg 

541 gctggaaggc gtgatgcgcc tgacgcctgc gttggggcag cagaaactct atgttctggt 

601 gtttacgacc gaaaaagatc ttcagcaaac cactacgctg ctcgatccgg caaaagccta 

661 tgccaaaggc gtcggtaact ctatcccgga tattcccgat ccagttgctc gccataccac 

721 tgacggcgtg gtgaaactga aggtgaaaac caacagcagc tccagcgtat tggtcgggcc 

781 gttgtttggc tcgtccggta ccgggccggt aacggtaggc aataccgccg cgccggtcgc 

841 tgcgccagcg ccagtcgcgc cgaagaaaag cgagccgatg ttgaacgata cggaaagcta 

901 ctttaataaa gcgattaagg atgccgtagc gaaaggcgac gtcgataaag cgctgaaact 

961 gcttgatgag gccgaacgtc tgggatcgac atctgcccgt tccaccttta tcagcagtgt 

1021 aaaaggcaag gggtaattat ttccccacag tgctgatttt gcagcaactg gtgcgtctcc 

1081 tggcgcacct ttttttatca tttcatgcga attgttattt ttttgttgcg caacagatca 

1141 cttaattgtg attgcgcccc ccgtaatgtc ttttctgaga tacaatgact ttaggttatg 

1201 aatcggagag taaggcatgt cacaccctgc gttaacgcaa ctgcgtgcgc tgcgctattt 

1261 tgatgcgatt cctgcgcttg aaccccattt gctggactgg ctgttgctgg aagattctat 

1321 gactaaacgt tttgagcagc agggaaagcg ggtaagcgtg acgctgattc gggaagcgtt 

1381 tgtcggccaa agtgaggtcg aggaagcatc aggtttactg ccatccgagt cccgctactg 

1441 gttacgtgag atactgcttt gcgctgacgg cgagccctgg cttgccggac gtaccgtggt 

1501 accggaatcc acgctgtgcg gacctgaaca ggtcttacaa catctgggta aaaccccgct 

1561 tggacgttat ctatttacat catcgacact gacccgcgat tttattgaaa ttggacgcga 

1621 tgcaacgctg tgggggcgcc gctctcgact gcggttgagc ggaaaaccgc tgttgcttac 

1681 cgagcttttt ttacctgcat cgccgctgta ctgagaggaa gataaagatg gagtggagtc 

1741 tgacgcagag taagctgctg gcgtttcacc gtttgatgcg tacagataag cctattggcg 

1801 cgttgctgct gctctggccg acgctttggg cgctatgggt ggcgacgccg ggtatgcctc 

1861 agctgtggat tttggcagtt tttgtcgctg gcgtgtggtt aatgcgcgcg gcgggctgcg 

1921 tggtaaatga ttatgccgat cgtaaatttg acggacatgt aaaacgtacg gtcaaccggc 

1981 ctttacccag cggcgccgtg acagagaaag aagcgcgcaa cctgtttgtg gtgctggtgc 

2041 tgctggcgtt tctgttggtg ctgacgctga acgccatgac gattctgctt tccgtggcgg 

2101 cgctggcgtt ggcctgggtt tatcccttta tgaaacgcta cacccacctg ccgcaggtcg 

2161 tgctgggggc cgctttcggc tggtcaatcc ccatggcgtt cgccgcggtg agcgaatccc 

2221 tgccgttgag ctgctggctg atgttcctcg ccaatattct ctgggcagtg gcttacgata 

2281 cgcaatatgc gatggtcgac agagatgatg atatcaagat aggcattaaa tcgacggcga 

2341 tcttgtttgg tcgttatgac acattgatta tcgggattct gcagcttggc gtcatggcgt 

2401 taatggcgct gatcggctgg ttgaatggtc tgggatgggg atattactgg gctgtgctgg 

2461 tggcgggcgc gctgtttgtc taccagcaaa aactgattgc gaaccgtgag cgtgaagcct 

2521 gctttaaagc ctttatgaac aataactacg ttggcctggt gctattttta gggctggcga 

2581 tgagttactg gcacttctga tgtcatgccg ggcggacccg gcctacaaag ccaaaatttt 

2641 caatacgtca tatcagttgc ccgataagct tacgccatcg ggcatttcag gcactggctt 

2701 actcgccctg cgtcgcgctt tcaatcgtca gacggacatc agaggtaatc agatcggcaa 

2761 gcatctgata tatcttcatc gtttctgccg gttcggcatc gcccgtatca ctgatataac 

2821 cctcatcccg tagcgtcaac acgagtgaac tgaacaccgc tttgtcgaaa aactccggcg 

288] cgttaatacc gtgcaggaca gacagacgtt gcgccacggt acggctttct ttctccagcg 

2941 tactgcggtt gatggacgga ttggcgctca acagccagaa cgtaatggca taacgctgta 

3001 gcgtttcgcg cgcgcctgcc gccagcagtt gcagcgtacg ggaatgcgtc gggttgatat 

3061 gcagttcatc atcctgcagg gtaattagcc cctgacgctg catttcactg gccagcgcgt 

3121 caatgaccga ggctaactcc tcgcgctccc agcgcaggaa cagctccgct tttaacatcg 

3181 gatagagcgc ttcaacatgc tgctgcaacg catcacgcga aatgcgacga tgctgggtga 

3241 taatcgccgc catcagcgaa ggcataatca acatatgcgc gatattgtta cgatagtagg 

3301 tcatcaacac cgcctgctcg cgcggcagaa tgatgatatc gccaatggtg tctttctcga 

3361 cctcaaactt attcatctgc agcgcgtgag cgataagctc gccagcgctg gcggcaggaa 

3421 cggttgagtc tgtggaataa gggacattgc gcatcagatc gagatagcaa tcaagctgct 

3481 cggtgagctg ctcgcgggtg agcgagcgct gacgcgacgc taacagcgcg gtacaacaca 

3541 ggttcatcgc gtttgccgcg ccagcgttgt taatacgcac catcagatcg gcggcgatac 

3601 tatttaccgt cggcgtcagc caggctgggc gtatggcctc tatagggtcg atagattcgc 

3661 gccactccgg cacatgctgg ttcaggtacg tcatgagcgg catcggttcg ccaaagttaa 

3721 cgtagccctg acccagattg cgcagcttgc tcaagccttt cagcatctgc ggcaggctct 

3781 ctttctcttt tgtcgcgccg cgcagttctt tcgcataggt gcccacttcc atgacgtgct 
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3841 catagccgat gtaaataggc actaacgtaa 

3901 ggatagtcat cgacagggtg ccggttttcg 

3961 ctccttcgac aaaatactca accgagtaac 

4021 gaaagacggt ggaatagagt ttattcccct 

4081 gacgacggaa aatcgggcca gcgggccaga 

4141 gcggaaccag tccctggtga tacagcacat 

4201 ggtgacaggg tacatagacg atttcatgcc 

4261 cgttatgcac gttaattccc tggtaaagtc 

4321 tcaaacgaat catctcgtaa gagaagttcg 

4381 tttgctgcgc tttttcatga gagatttttt 

4441 cgatcgcttt cgaggccagc agtttgttaa 

4501 caaccgccgc caggcgctgg cgagcaaagt 

4561 cgatgatttt atcggtgccg tgttcatcag 

4621 atcggacaaa gctatcgcga cccagccagg 

4681 gcatacgtaa aggcgggtta tcttctccct 

4741 cggaaacggg caccatctgc acgtccaaag 

4801 cgtgaaaaag tttcaccgac tcttcttttg 

4861 cgtgaataaa tacatagcgt ggcaacagcg 

4921 ccgggagatc gtgcgccaga cattgcgccc 

4981 atggcaagac atacataatg gggcgagaag 

5041 ccggaataga cttgcttttt accagtatgc 

5101 ttcgtggcca gccggacata aacgatgtaa 

5161 aaggatatca gaaagtttag cgaatttctg 

5221 cggcagtccg aatcgtgcag ggcaaccgta 

5281 gttcttttga tggccaataa taccactgga 

5341 tcctggaaag gatttcgcgc cgcatggact 

5401 gcggtattgc tgggcgtcat catcgcctgc 

5461 ttattgatag gttccgtttt acttatcatg 

5521 gcggtcgtgg accgcatcgg ttctgaatac 

5581 ggatcggctg cggtgctact gtctattttc 

5641 tggtcacatt ttcgataaag gttccaaatt 

5701 tggttccaaa atcaccttta actgtatata 

5761 ggcggaatga aagcgttaac ggccaggcaa 

5821 atcagccaga caggtatgcc accgacgcgt 

5881 tccccaaacg cggcggaaga acaccttaaa 

5941 gtttccggcg catcgcgagg tatccgtctg 

6001 gtcgggcgtg tcgcggcggg tgaaccgctt 

6061 caggtcgatc cttcgctgtt caaacccagc 

6121 tcgatgaaag acatcggtat tatggatggg 

6181 gtccgtaatg gccaggtggt tgtcgctcgc 

6241 aaaaagcagg gcaataaagt ggagctgctg 

6301 gtggatctgc gcgaacaaag ctttacgatt 

6361 ggggaatggt tgtagtctct ttttaatctc 

6421 cctgatggcg ctgcgcttat caggcctacg 

6481 aggccggata aggcgttacg tcgcatccgg 

6541 aggcttttct ccatgccgct ttttacctcc 

6601 ccgatgattt tttctaacat caccgtcccc 

6661 ggtcatctgg acagccctgt tttcctgggc 

6721 tttcttttca tgctgctgct atttctacgt 

6781 tttggcgcca aaaatccgca ggcgttagcc 

6841 ctgggcgcag gcgtgatgat tgtgctattt 

6901 attgtcggcg ggaacgacgc tgtcctggtg 

6961 ctaagcgctc ccgcgtcact ggcgaatctg 

7021 tatgcgcggg cgccggttat cttgttggtc 

7081 ctgtggctgg tgatggggct tcatatgaac 

7141 gcagaatatg tcacgttact gatcggcctc 

7201 ggtgtgtcgc tggatatgct aaaacaggcc 

7261 ctcaatcgcg atatcatgct gcgctcattg 

7321 gtgtcaggcg cgcggctggg cagcgatatt 

7381 ctcaccttta cggcctatgc gctggacggt 

7441 caggcttacg gcgcgcgtga cggtagcaag 

7501 cagtcaggaa ttgtggcctt gttgttttcc 

7561 gtcgcgttac tgacctcatt gccgcaaata 

7621 caggtggtat taccgctggt gggcgtatgg 

7681 gcgacgcgcg ctgccgaaat gcgcaatagc 

7741 acgctcttcg ccctgccggt attaggaaat 

7801 ctggcgcttc gcggtctgtc gttgagcctt 

7861 tggtttgcca gatcgtgacg gttaaaaatt 

7921 cctacaatta ttatcacgtc cagcagaaat 

7981 attcctaacc gaatgatgag gacaagatga 

8041 aacagtttaa aggtaaaatg aaagaacaat 



ttggacgggt accgccgcgc agcatcgcct 
ggtctaacaa acgtccggta cgcgaacgtc 
cgcggctgaa cagttcgccc aggtattcgc 
taaaggtgcg acgaataaag aacgcgccca 
aattaaggtt aataccggcg gcaatgtgcg 
aagaaaggag caagtagtcc atatggctgc 
cgtcgtgcgc cagttggcgt acgcgctcgg 
ggttccaggt aaaccccaaa atcctgtcgg 
ccgcgatctc ttccatcagc gcaatggcgt 
tactgcgcgc ttcgtcttct accgcgcggg 
acagatcctg acgggcaggc aggcgtgggc 
gcatacgcgc cacgcgagcc agcttctgcg 
ccatacgacg caatgagacg gacggcgaga 
agatggcaaa gaatttctgt acgccgttca 
tttcacgacc cggcgcgcgg ccaaacatca 
ccggattact ccgatgcaga tcgagataat 
gcgtgtaata ggtaaagaca cgcggtccgc 
cgccgtcgat ctccagcggc tccagcggat 
gcagcgtcag taagtccgct tttgagttgt 
tatcgagccc caattcctgg gcgggttccg 
ttaatggtaa attcagtaat ttgtagtaaa 
agcctctggt taataatgca aatgcggtgc 
ttgtatcccg tcatacttcg cgctgcgtta 
taatgtcatt gacgttaata atgtaaaaag 
ttcaccagaa ttatcaaagc ggcaggatat 
cacgaagcgg cgtttcgcca ggaaagcctc 
tggctggatg tggatgcgat tacccgtgtg 
attgttgaaa tcctcaacag cgctattgaa 
catgagcttt ccggccgtgc gaaagatatg 
gttgccctga tgacctgggg catcctgtta 
ttataaatct cttgtttttt gcgccgtttg 
ctcacagcat gattgtatat acacccaggg 
caagaggtgt ttgatctcat ccgggatcac 
gcggaaatcg cgcagcgttt ggggttccgt 
gcgctggcgc gcaaaggggt gcttgaaatc 
ttacaggaag aagaggacgg attaccgctt 
ctggcgcagc agcatattga aggccattac 
gccgatttcc tgctgcgcgt gagcggtatg 
gatttgctgg cggtacataa aacgcaggac 
attgatgatg aagtgacagt aaaacgcctg 
ccggaaaaca gcgagtttac accgatagtg 
gaagggctgg cggtaggcgt cattcgcaac 
cttgtaagcc gccatccggc aatcgtgtag 
ggaatgcagt tcctgagatg attaatttgt 
caatgcgctc gtttcgttta actccttgtc 
tctgataaag ctctctggcg tcttgcgttg 
ttactggggc tggtcgatac ggcggtgatt 
ggcgtggcgg tgggcgcgac tgccaccagc 
atgagcacca ccggcctaac ggcgcaggca 
cgggcgctca ttcaaccgtt attgctggcg 
cgcacgccgc ttatcgagct ggctttacat 
caggcgagac gcttccttga aattcgttgg 
gtgctgctcg gttggctgtt aggcgttcag 
gtgggcaata tcctcaatat cgcgctcgac 
gtgcagggtg ctgccctggc gacggtcatc 
atgatggtac gtaaagttct ccatctccgc 
tggcgcggca acgtgcgccg ccttttggcg 
ctccttcagc tttgttttgg cgcaatcacg 
atcgcggtaa atgcagtcct gatgacctta 
tttgcttatg cggtggaggc gcactccggt 
ctactggacg tctggcgagc ggcgtgtcgt 
acagtctatg ccctggcagg cgaacatatt 
cagctactgg cggatcggta tcttatctgg 
tgctatctgc tcgacggtat gtttattggc 
atggcagtcg ccgcgggagg attcgcgctg 
catggtttat ggctggcatt aaccgtgttc 
atctggcgtc gtcactggcg cgagggtaca 
ctgaatataa aaccgaaggc cgaattcctg 
tgcagagcat ttgggcgaat aacacctgct 
tgaataaaga cgaagccggc ggtaactgga 
ggggcaaact gaccgatgac gatatgaccg 
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8101 ttatcgaagg taaacgcgac caactggtag gtaaaatcca ggagcgttac ggttaccaga 

8161 aagatcaggc ggaaaaagag gttgttgact gggaaacccg taacaactat cgctggtaat 

8221 atgttttccc ctaccaggag tacatggatg tgcttccccc tgtttgctta tgaatgcggg 

8281 cgtgcccctc tctactgccg cccgttttcc gttcaggcta acgtggtttt ttcttcacca 

8341 gcaccgagtg atcgtgacca cagtttcccg gatggcgaca cgcttctact tctacgcacg 

8401 cagggcacag accgtgcgcc tcaataacgt tatgtcgtag cgcaaagccc attttagccg 

8461 ccagcgtatg catgatgtct tccacgccct cggcacactc ttccttcacc acgccgcaac 

8521 gatcgcagat aaacatggcc gagctgtgcg tcggctggtc gaacaggtgg caaaccacat 

8581 aactattggt ggattccact ttatggacaa aaccctgttc gagcaaaaaa tccagcgcgc 

8641 ggtaaatggt gggcggtttg gcctgtggtt ccgtttcgcg cagcaaatcg agcaaatcgt 

8701 atgcgctgat ggcaccctgt tgcagactca tcaggcgcag cacttcgagg cgctgaggcg 

8761 tcaggcgcac gttgcgctgc gcgcagagtt tttcagcttg cgccagtaac tcttgcgttg 

8821 tggtcttttc catgtagcac ctcgaatggc atgaaagtct aatccgttac tttaccacga 

8881 gctgcttaag actctgaaat taccccgcga caatatatcg cctgctaagc ttttccgcgc 

8941 cagtattctc atctatacat aatgagggtc gatattataa caacaggatt taacatcact 

9001 atttcacagc aatagttcgc ctctgtccgg taaagcgtgg tggaggaatt acttcacctg 

9061 ataataagat gggctaactg tgaacaaaaa cgtcaaactt tcactgattg ctatcgcggt 

9121 ctcgcttttt atggcaaagc aggcgagcgc cgccaatacc tggacggaag cgcgtaatga 

9181 cgcaatgggc gggacgggcg ttgcgtccgc gaattatggc agcggagtac tattaaatcc 

9241 ggcgttactg gcaaaagcca aaccggaaga caatatcact gttgtccttc ctgcggtcgg 

9301 cgtccagatt acggataaag acaatctcca ggatgaaatt gacgatatca gcgataaagt 

9361 ggattactac gatgaggtcg tcgataacct tacgctgggg caaattctgc tcaatccgcg 

9421 aggcgtgctg aatcaatttc agggcgccgc gcgcgacctg gccgatgagc tggaatatct 

9481 caacggaaaa accgcccgcg ccaatgccgg cgccgggttg gcggtaagta ttccagggca 

9541 aacgctttcc gtggcgttta tcgccaaagg ttatgcgcat gggcgcgtga gttcgtccat 

9601 tgatcagaac gatattcagt atctgcgcga tatccagcat gatgaacgtg tcgcgcttcg 

9661 tgaagcagga cgcgctgctt tgctggggtc tgacgaaatt acaaagcatt taaactctac 

9721 cgcgtcgggg cgggtagcga ttgtatctga ctacggtatt gcgctggcga agcagtttgt 

9781 ggtgggcgag gtgcctgttt ccattggcgt tacgccaaaa ctgcaaaaaa cctggctcta 

9841 taactacacc acctcaattt ataactacga cagtagcgac tggaatagca gccgttatcg 

9901 caatgacgat acgggtttta acattgatgc ggggcttgct gccgatatcg gtgaaaactg 

9961 gacgctgggg gtgagcggac aaaatctggt atctcgcgat atcgatacga aagatattta 

10021 catcactaat ggtatgacgg gagaaaccac caactacaaa gatacatacc aaatccgccc 

10081 cctggttacc gcgggcatcg catggcataa cgatttgctg accgtaagcg ccgatggcga 

10141 tctgacggaa acgaaaggct ttaagagcga agacaattca cagtacgttg gcgtcggcgc 

10201 tgaagttcga ccgttgtcat ggctggcggt gcgcgcaggg tatcgtgcgg atgtgaaaaa 

10261 taacgatagc aatgtggtta ccggcggtct tggctttgcg ccctttaacc gtgtgcatct 

10321 cgatttaatg ggactgtacg gcgaagatga aacctggggg gcaggcgcgc agcttacaat 

10381 gacattctga ggtgctttca ccggaggcgc tacgcctccg gtttagcgtt atgctatagt 

10441 aacgcccctt ttttaaccgg atgcttattt attcgccatg cagcctgaaa cccagtcctc 

10501 cgctttacct gcttatcgct tctccatcgc gcctatgctc gactggacgg acagacattg 

10561 ccgctatttc ctgcgtttgc tgtctcgcca gacgcagctc tacaccgaaa tggtgaccac 

10621 gggcgcgatt attcacggta agggtgacta tctggcttat agcgaagaag agcatccggt 

10681 cgctctacag cttggtggaa gcgatccggc tcagcttgcg cattgtgcaa agctggcgga 

10741 agcgcgtggc tacgatgaaa ttaacctcaa cgtggggtgc ccttccgatc gcgtgcaaaa 

10801 cggtatgttt ggcgcctgtt tgatgggcaa tgcgcaactg gtcgccgatt gtgttaaagc 

10861 catgcgtgat gtcgtctcga ttccggtgac ggtaaaaacc cgcattggta ttgacgatca 

10921 ggacagctat gcgtttctgt gtgatttcat cgatacggtt tccggtcagg gcgaatgcga 

10981 gatgtttatt atccatgcgc gcaaagcctg gctttctggc ttaagcccga aagaaaatcg 

11041 tgagatcccg ccgctggatt acccgcgcgt ctatcagcta aagcgggatt ttccgcacct 

11101 gaccatgtcc attaacggcg gcatcaaatc gctggaagag gcgaaggagc acctgcgcca 

11161 tatggatggc gtcatggttg gccgcgaagc ttatcagaat ccgggtatac tggccgcggt 

11221 ggatcgggag atttttggcg ccgataccac cgatgccgac ccggttgcgg tggttcgcgc 

11281 gatgtatccc tatattgagc gtgaattgag ccagggagcg tatctggggc atatcactcg 

11341 ccatatgctg gggttgttcc agggcatccc cggcgcgcgg cagtggcgtc gctatctgag 

11401 cgaaaacgcc cataaagctg gcgcggatgt tgctgtactg gagcaggcgc taaaactggt 

11461 ggcagacaag cgttaaaagt tcgccaaaaa ttagtcaatc tcaccacgcc ctgtgcagtc 

11521 ttgcagggcg tttttcttat atatcaataa tataaatatt ggcatgattt ttgtaagggc 

11581 ttacctgacc agacccggta agcgccgtgc cgccgggcaa tgccattttt atggggagcg 

11641 actatgctgg aactactttt tgtgcttggc ttttttctga tgttaatggt gacgggcgtc 

11701 tccttgctgg gcattctggc cgcgctggtt gtagcgactg ccgtcatgtt cctgggcgga 

11761 atgttcgccc tgatgatcaa gctgttaccg tggctactgc tggcggtggc tgtggtgtgg 

11821 gtgatcaaag cagtaaaaac gccaaaaatc ccacagtatc agcgcaataa ccgtcggttt 

11881 tactaaggta ttgagcggtt cgtcacaacc tggaactttg ccctcacaag caaataggaa 

11941 tcgattatta aatctgtcat gattgcgcag gtaacgaatt catcgcgctg taccctacat 

12001 acagccgaac aaaaaaagaa agggcttccc gggtggaagc ccaatttctt tgcagagcta 

12061 ttacggaatc agcaggcttg agccctgtgt ggcccggctt tccagcactt catgcgcgcg 

12121 acgggcatct tttaacgcat agcgttgatt ttcagccaca tcgactttaa tcacgccgct 

12181 ggcgatcaat gagaacaatt cattgctggc ttcggtcagt tcttcacgtg tcgtaatata 

12241 cccctgtagt gaaggtcgcg tggcatacag ggaacctttc tgattcagaa tacctaaatt 

12301 cacgccggtg acggggccgg acgcattgcc gaaactgacc atcagtcccc gacgttgcag 
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12361 gcagtccagt gaggcctccc aggtatcttt 

12421 Utgccgccg gtgatctctt ttacccgttc 

12481 ctgccaggca ccggcgtcca gcgcccgctg 

12541 aagcttcgcg cccagcgctt ttgcccattg 

12601 ggcatgaaac aggaagggtt cgtcgggttt 

12661 aacggtcaat cccttgagaa aagaggctgc 

12721 aatcgcggct ttatctgcgg tgacgttatg 

12781 cgcgtagacg acgcgatcgc cgacgcgaat 

12841 cacacccgca gcttcggttc ccaggcccgc 

12901 acggatatag gtgtcgatga agttgatacc 

12961 ttccgccggt tccgctggcg taaactccac 

13021 gtgaaattca atacgcgttg ccatgcttcc 

13081 aatcactatc tcggtaactc cattcactat 

13141 gactgatgcc cgcgaccgcg atccgcaggt 

13201 tgaagcggaa cagtcggtgt tgggcggttt 

13261 ggccgagcgc gtggtggcgg aagatttcta 

13321 gatggggcgc ttgcaggaaa gcggcagtcc 

13381 ggagcggcag ggccaactgg acagcgtcgg 

13441 aaacacgcca agcgcggcga atatcagtgc 

13501 ggtccgcgat atgatcgcgg tggcgcatga 

13561 gcgcaatagc gacgaactgc tggatctggc 

13621 ccgggccaac aaagacgaag gtccgaaaag 

13681 gcgtattgag cagttgttcc agcaaccgca 

13741 tcaggatctc aataaaaaga cggcagggtt 

)3S0) gcgtccctcc atgggtaaaa ccacttttgc 

13861 gcaagataag ccggtactga tctttagtct 

13921 tatgctggcc tcgctgtccc gcgtcgatca 

13981 tgaagactgg gcgcgaatct ccggcacgat 

]4Q4] tattgacgac tcctcaggtc ttacgccgac 

14101 ccgcgaacat ggcgggttaa gtctgattat 

14161 gtcgctttct gataaccgta ctctggaaat 

14221 ggcgaaggaa ctccaggtgc cggtcgtggc 

14281 acgcgcggat aaacgtccgg tgaactccga 

14341 tgccgactta attatgttta tctaccgtga 

14401 aggcattgct gaaattatta ttggtaagca 

14461 gacgttcaat ggtcagtggt cgcgcttcga 

14521 gtaactctcc gtcattcttt taacaaggaa 

14581 accgccgcgc tctgcgacac aacctgcaac 

14641 tggttgcggt ggtgaaagcg aacgcttatg 

14701 tccctgatgc tgacgctttt ggcgtggcgc 

14761 gcgggatcac gcagccaatc ctgctgctgg 

14821 ccatttccgc gcaatgtctg cataccgccg 

14881 aggcggtgga gctggcggag ccggtaacgg 

14941 gtctcggcgt gcgtcccgaa gaggcggagg 

15001 atgtacgcca gccggtgaat atcgtcagcc 

15061 gcgctaccga acatcagctc gacattttta 

15121 gctctattgc cgcgtctggc ggtatcctgc 

15181 gtccgggcat cattttgtat ggcgtatcgc 

15241 ttggttttca gccggtgatg tccttaacct 

15301 cgggcgaacc ggtgggctac ggcgggacat 

15361 tggtggcgat gggttatggc gatggctacc 

15421 tggtcaatgg tcgtgaagtt ccgattgtcg 

15481 atttggggcc aaacgcgcag gataacgcgg 

15541 tgccggttga acgtatcgct gaaatgacaa 

15601 tgacctcaag ggtggcgatg aagtatattg 

15661 gcgcttatcc ggcttgtatc gcgcattggc 

15721 tccggtaata tgcgcatcca ccacaacatc 

15781 gtgttatttc ctgccctctg taaacctgga 

15841 ctatgccggc gatccgattc tttcactgat 

15901 caaagtgaat ctgagcattg gcctgtatta 

15961 aacggtggcc gaagccgaag cccggcttaa 

16021 gccgatggaa ggactcaata cttatcgcca 

16081 tcacccggtt cttcagcaac agcgcgtggc 

]6\4] gctgaaagta ggcgcggatt tcctgaagcg 

16201 cgaccccacc tgggaaaacc atattgcgat 

16261 ttacccctgg tatgacgacg cgactaacgg 

16321 gaatacgtta cctgcgcgca gtatcgtgtt 

16381 ggcggattta acgccttcgc aatgggatgc 

16441 gattccgttt cttgatattg cctatcaggg 

16501 cgttattcgc gccattgcca gcgctgggtt 

16561 gattttctcg ctgtacggcg agcgcgtcgg 
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ccccacggag tcatagacca cgcggacttt 
gacaatgctc tcctcacggt aattaattac 
cgctttttgc gcgctaccga cggtaccgat 
gcaggcgatc agaccgacgc cgcccgcagc 
cacttcatag gttttgcgca acaggtaaaa 
cgcctgttcg aaagaaatgg cgtcaggcaa 
gacggaactg taagcgccga gcgttgactg 
gtgctccacg ccgttgccga ctttactgac 
aggcaacgac gggggcggat agagtccgct 
aatggctttg ttctcaacct ggatttcgtg 
ggtctgaagc acttccggac caccatgctt 
tccaaaagaa atgtggtaat ctttcgaccc 
ggcaggaaat aaacccttca acaaaccaca 
tgccgggata aaagtgccgc cgcactcgat 
aatgctggat aacgagcgct gggacgatgt 
tacccgcccg catcgccata tctttacgga 
tatcgacctg attacgctcg cggaatcgct 
cggtttcgcc tatctggcgg agctgtctaa 
ttatgcggac attgtgcgcg aacgcgccgt 
aattgcggac gccggttacg atccacaggg 
ggagtcgcgc gtcttccaga tcgcggaaaa 
catcgaccag attctcgacg ccaccgtggc 
cgatggcgtt acaggcgtgg ataccggcta 
acagcgttcg gatttgatta tcgtcgcggc 
gatgaacctc tgcgaaaatg cggcgatgtt 
ggagatgccc ggcgaacaga tcatgatgcg 
gacacgtatt cgtaccggtc aacttgatga 
gggcattctg ctggagaaac gcaatatgta 
agaagtccgt tcgcgcgcgc ggcgtatttt 
gatcgactac ctgcaattga tgcgtgtgcc 
cgccgaaatt tctcgctcgt tgaaagcgct 
gctatcgcag cttaaccgct ccctggaaca 
cctgcgtgaa tccggctcta ttgaacagga 
tgaggtttat cacgagaaca gcgacttaaa 
acgtaacggt cctatcggta cggttcgtct 
taactatgcg ggaccgcaat acgatgatga 
ttcaaatgca agcggcaaca gtcgtcatta 
gtctgcgtga actggcgcct gccagtaagc 
gacacggtct tctggagacc gcgcgaacgc 
gtcttgaaga ggctctacgt ctgcgagcgg 
agggtttttt cgacgccgcc gatttgccga 
tacataatca agagcagctt gccgccctgg 
tctggatgaa gctggatacc ggtatgcatc 
cgttctacca gcgtctgacg cactgtaaaa 
attttgcccg tgcggatgag ccggaatgcg 
atgccttctg tcagggtaaa cccggtcagc 
tgtggccgca gtctcacttt gactgggcgc 
cgctggagca caaaccctgg gggccggatt 
ccagtttgat cgcggtgcgt gaccacaaag 
gggtgagtga gcgcgacacg cgcctgggcg 
cacgagcggc gccttccggt acgccagtac 
ggcgggtggc gatggatatg atttgcgtag 
gcgatccggt ggtcttatgg ggtgaaggtc 
aagtaagtgc ttacgaactt atcacgcgcc 
attaaatacg cggcgggccg ggtggcgctt 
tgtaggccgg ataagacgtt cacgccgtca 
ctctcgatct tctcgtcctt ccggtttatt 
gaaccatcgc gtgtttcaaa aagttgacgc 
ggagcgtttt aaagacgact cccgtcacga 
caacgaagac gggattatcc cacagcttaa 
cgcgcagccg catggcgctt cgctgtacct 
tactatcgcg cctttgctct ttggcgccga 
cactatccag acattaggcg gctccggcgc 
ttatttcccc gacgcaggcg tatgggtgag 
atttgccggg gcaggattcg aagtgagtac 
catccgtttt aacgatctgc tggccacgct 
gctgcacccc tgttgtcaca acccgaccgg 
ggtgattgaa atagtgaaag cgcgcgatct 
gtttggcgca ggcatggacg atgatgcgta 
acctgcgtta gtcagtaatt ctttttcgaa 
cggcctgtcc gtggtgtgtg aagatgctga 
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1662 

1668 

1674 

1680 

1686 

1692 

1698 

1704 

1710 

1716 

1722 

1728 

1734 

1740 

1746 

1752 

1758 

1764 

1770 

1776 

1782 

1788 

1794 

1800 

1806 

1812 

1818 

1824 

1830 

1836 

1842 

1848 

1854 

1860 

1866 

1872 

1878 

1884 

1890 

1896 

1902 

1908 

1914 

1920 

1926 

1932 

1938 

1944 

1950 

1956 

1962 

1968 

1974 

1980 

1986 

1992 

1998 

2004 

2010 

2016 

2022 

2028 

2034 

2040 

2046 

2052 

2058 

2064 

2070 

2076 

2082 



aatcgccgcg 

gccgtgtt tc 

ctggctggcg 

gaaggagctg 

tatgttcagc 

cgtttacctg 

ccgcgtggcg 

gccctccggc 

taatccttcc 

atcttaataa 

gctgagtgcc 

cccttcgacg 

ttgggtctct 

ctttgatatt 

ctactccccg 

cggctgggat 

tcgccgtggc 

gtcgaaaacg 

cgccggcgat 

tatcttctac 

tattcgcatc 

tggcgaggaa 

tttttgcatg 

cacactat tc 

at tacactga 

ttgtctggct 

gcctcgctga 

ttcctgaaaa 

atgcct tcgc 

gggcgt ttac 

ggtccacgta 

tgcaatactg 

cgcaaattaa 

cagcctctct 

tacgaccgag 

tgccggat tc 

tgtcggaaga 

gccagggcga 

tgcgcat tat 

tcgtcgtcgt 

gactcgcgga 

gcaatcgccg 

tcgt taatag 

gccc tgtatg 

gtaagtaagt 

cagcagcgcc 

ctgaaaaaat 

tattcgcgcc 

aaaagccacg 

tagtgtgatt 

catgtcagag 

atgaggaaaa 

ataggt ttaa 

ccggcgacga 

tccgccgttt 

cgcaactgat 

ggctcgtcga 

ttcacgcgct 

gtcagcccta 

aactcacgcg 

ttaatctcca 

atatccggca 

cgaccgccgc 

ggcacgccag 

gaacgcgggg 

agcccctgaa 

tgggcgatgg 

gtgatacagg 

cgcgcgccgg 

tcaattttgc 

gc t tccagcg 



cgggttctgg 

ggcgctcagg 

gaagtcgacg 

aaggcggaga 

tataccgggt 

attgccagtg 

aaggcatttg 

ctgacaatag 

ttttgaccgt 

ttataatatt 

gtctgtttgt 

ctcaatccgg 

gtcgcccaga 

gacgacaccg 

gacagcgacg 

gagttcagta 

gacagtattt 

ctggcggata 

aaaccggaac 

ggcgattcgg 

ctgcgcgcgg 

gtcattgtca 

atttttgcac 

aggagacgtt 

gcgagaagcc 

taccgcctgt 

cgt taaacga 

ccgttcccga 

atattaagtc 

agttaggaac 

aaatcatcgc 

catggcaaaa 

agtggaagat 

caaaaccagc 

tcggcacct t 

acaactctat 

taaacgcaag 

tatcaatgag 

tggcgcgtag 

ccagccagcc 

tagcgtcgtc 

aggacgctac 

acgccatttt 

gctaatt tec 

ggttgcttac 

agattatcga 

atctggcgga 

gcgtcctgca 

gegctaaage 

gegtagggea 

egaggeggea 

atgatgeege 

ggaageggge 

gaatttcgee 

taatgacgtc 

geagaaegtc 

gaatatacag 

gggcctcgcc 

cgtccatcag 

cttcttcaat 

gcgtttcccg 

gaaagtgcat 

gcacgttgaa 

caaaaagctc 

tgcgcccgat 

tatcccgata 

ggaacagcgt 

taaacagccc 

tgagtttgag 

gtt taccget 

ggect teege 



gacagctaaa 
tggtegctae 
egatgegtaa 
tgcctgaccg 
taagcgagga 
gccgcatgtg 
ccgctgtcat 
ctgttgtgat 
tactggcgtt 
ttgaatttta 
tgtttacgct 
gcactaatgt 
ttgaaaacag 
tgttattttc 
attatctgaa 
t tccaaaaga 
actttgtcac 
act tccatat 
aaaatacgaa 
ataacgatat 
ctaactccac 
acteggagta 
aaacttgccc 
tgtttttagg 
gcgcgggttt 
egaaaegggt 
aaactgegat 
taatgcggcg 
gtcagtattg 
atggcaaggt 
gacactacaa 
cctggcgcag 
gtacttttcg 
ccggaactgg 
aataaagege 
tatctggtcg 
cagt tgeege 
geggatgace 
ectggegagg 
gcgagcatgt 
gctgtcttct 
agggtgttcg 
catctccctg 
agacctgat t 
gtegcataac 
gacggtgt tt 
tatcgeggtc 
ggaagcgggt 
ggtcaggcag 
acttcatggt 
cgcaaatcac 
t tcccggcga 
ggtatgegae 
gccgccgctg 
caggttgtgt 
aagcaactgc 
cgtctgcccg 
geeggaaage 
cgtctgcagc 
ggtcatatcc 
gt tatagege 
t tcgactt ta 
getgaatege 
gcgaaccggg 
tgggctctgg 
cggcgccggt 
gtcgttaatc 
taceggtage 
cactttttct 
catgtactgg 
caccacctcg 



agegaeggtg 
ggtcctgggc 
cegcattata 
caact ttgat 
gcaggtcgat 
tgtcgccggg 
gtaatattca 
catcctcttt 
atggtcagat 
agggaaaacc 
aaaccattcc 
tgeaaaaetc 
cctcaccgga 
cagccccgga 
aaatccggcc 
agtggcccga 
tggtcgtagt 
tccggcggcg 
agtacagtgg 
taccgcggcc 
ctacaaaccg 
ctaaaaatca 
cgctggcttt 
aaggagcagc 
catctgataa 
t tgetgeate 
cccaccgtgc 
tatgaacatg 
ggcgtctcgt 
atctggttgg 
ggggaatgat 
ageagagegt 
cgatggtaaa 
cagagttgt t 
actggagcac 
atgectctta 
agtcctgatt 
gtagaccgga 
tcccgctgtc 
tectgaageg 
cttgcctgct 
gectgggegg 
agtgcagcgg 
attttgetgt 
ccgcgcataa 
atccccacct 
atccttgagc 
tgagegeget 
cgtacaggcg 
gttttccata 
aaatgctcaa 
ateggaaage 
gcctcgcact 
ccgccttccg 
tcgatcacca 
tgaatategg 
gtgccgcgct 
gttgtcgccg 
ttacgcgcca 
agcact tcgt 
ttgcctttgc 
atgacgecat 
cctggcgtgt 
gtaaagaege 
tcgatatcga 
teggegatag 
agegtcgatt 
gtaagggtca 
ggatttgccg 
ccggtcagcg 
ccgccgtgaa 



cgccgaattt 
gatgaggegt 
tcgatgcgcc 
tacttgttac 
cggttgcgcg 
ettaatgett 
ggceggaegg 
tttegtaaca 
agttttttga 
atgaaaaaaa 
gcgaatgcgt 
gcggaacaag 
cgcccgccga 
ttctggcgcg 
ttttgggaaa 
cagctcattg 
cagacgaaaa 
aatatgaatc 
ctgeaggaga 
cgtgattgtg 
ctgccgcagg 
ggagagegae 
taccttttcc 
ctatgtggta 
cagatgaaat 
tgttgttgct 
gege tgatat 
at tacgaagg 
tgctgctgcc 
gagagcaccg 
aatgaccatt 
ccacagcgac 
agaagt tgaa 
aegtcagcag 
ggtctatctg 
teaacaggeg 
atagcgcgcc 
tacggctgcc 
cttccagcag 
cgegggegat 
tataegtctg 
tggt tatcag 
gcgccccaat 
tggtatctct 
agaegcaaat 
tgaegggega 
agtgtctctt 
tgtaatagcg 
gataaaaaca 
acagacegtt 
caeggataca 
ggcgtgaagg 
ccgccacggt 
ggccgaggtc 
cgatggtgtt 
caaagtgcag 
tcgacagttc 
actgaccaag 
gcgccggaac 
ggatggtctt 
actggtcgea 
cgccctggca 
agccgcgcga 
ccgtataggt 
tcactt tatc 
tcgccccgt t 
taccggaacc 
catctttcag 
geacgegt tg 
attceggtae 
cgcccgcgcc 



actccagtcc 
taaaagcggg 
agacgctggt 
ageagegegg 
atgagtttgg 
caaatgtaca 
cgataacagt 
aaacegcaga 
gcaaagactc 
taaccctggc 
tagtatcttc 
cgcctgtcca 
tggeggtegg 
gaaagaaaac 
aaatgaacaa 
atatgeatgt 
eggaaacegt 
eggtgat ttt 
agaacatgeg 
gtatccgegg 
caggagegt t 
aggctctcct 
agacttgetg 
tcagcgcacc 
taccgataaa 
acatacctcg 
ggagegecat 
cgeggatgae 
tgtacgccag 
tat teaegge 
teggaat tac 
tggaaagega 
ggacgcccgg 
cacagegatg 
gacggctcgc 
gtaaatgege 
gaccaacaac 
gagggegat t 
aataatcata 



aaacgcggcg 

gataaactcg 

gtaatgtatt 

gcagacactc 

cagectggat 

agtgattttt 

cccgctcagc 

tttgecaegg 

ttagttgttg 

ggcacagcag 

tgegctatea 

atgaggtggg 

ttatttgagc 

ttccggcgta 

gacaatccag 

gccctgatcg 

gccggtggtt 

gcgcgccagc 

aeggatatag 

cgcatcaaag 

gectttgtae 

eggcaegtaa 

cgcttcgcag 

gcgagactcc 

cgccgggttg 

gaaatgttcc 

taactgaege 

cgagacgccg 

gttgttgccg 

1 1 teggcact 

cgccataatg 

ggggecaata 
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20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 
23401 
23461 
23521 
23581 
23641 
23701 
23761 
23821 



tcaatcacat 

ttgccaagat 

agaccgatgg 

ctcgccagac 

gtgagatagt 

actttttcgg 

gcatggccaa 

cgcgcttcgc 

ttcgccagct 

tgcagcacgc 

ttaaattcaa 

aggcttgccc 

tgaaagtagt 

tcgggattct 

gtcgggcagg 

ctgtagccgc 

tcgtccatat 

tccgccaggc 

gtatgtttct 

atacgggcgc 

t tacgctctt 

acgttatcga 

cgcggctcgc 

cccaccgtag 

agcccctcaa 

gcggagagcg 

gaggatttgc 

aggttaatat 

ctttcccggt 

attatgacac 

atgcaacaat 

tacgtggaaa 

attatggcca 

gaagtacgct 

tcctggcgcg 

atgt tcggca 

atcgaaggtc 

actgaaatta 

ggcggcgcac 

cagccgcagg 

ccagcgccgt 

gtaaaaaact 

gtgtaaacca 

aagcgcatat 

cattatccag 

gagaaatctc 

ataaaataac 

ttgtgatgag 

attgttactt 

attctgattt 



ggtcggcggc 

tgcgcagatg 

acggctcatc 

gaatacgctg 

tcaggccgac 

cgattttcgc 

tgctcatatc 

gattcagtcg 

cttcgcgcac 

cttcgaacgg 

tgttctcttt 

acggcgcatc 

aaaaat tgcg 

ggatcacgcg 

cgcctgccgg 

aaatcggaca 

cggcaacaac 

gttgggaaag 

tttgtagctc 

gaatgtaacc 

taataatcgg 

ccatctggct 

ccacgcgggc 

agcgcgggtt 

tatggtcgac 

at tcaacgta 

ctgaacccga 

ttttgagatt 

agagactcgg 

aat ttaacct 

atgaggttcg 

tggtacactc 

gcagaggcgt 

atatgccgag 

ataagcagac 

aactggcgga 

aat tgcgtac 

acgttccgca 

cggcaggcgg 

gcggcaacca 

ctaacgaacc 

caaagcgt ta 

ggcattttct 

ct tacggaaa 

ccgaagttca 

atctgcaaaa 

ggtagagaat 

ctcaaatatg 

atagctggct 

tttactcccc 



gcgaatggcg 

aatcagcgta 

cagcacatac 

ggct tcgccg 

gttcaccaga 

tcgt tgcccg 

agaaatagca 

cgtgccttca 

cgcgctggat 

atggcggcgc 

gccggaaccg 

tacgtcgaac 

acgatcccaa 

gtccggatcg 

gttgttgaac 

ggcaaaat tg 

cgccgtgccg 

atcgt tgcga 

cagcttcggc 

ctggcttgcc 

cgcgagcagc 

aacggt ttgc 

aaacagcagg 

gtgcgatgtc 

atccggtttt 

acgacgctgc 

aagcccggtc 

atgggtgcgg 

atgcctggtt 

gtttgaatat 

cgcgagtgac 

acgcgtttac 

aaacaaggtg 

tggcggcgct 

cggcgaaatg 

agtggccggc 

ccgcaagtgg 

gatcggcggc 

tcagcagcag 

gt tcagcggc 

gccaatggat 

tttgcatttt 

tacctggtat 

aatgtcgct t 

taaatgtact 

tatataattt 

ggtcggtat t 

atatgtgtat 

ttatataaaa 

act tat tata 



tcttcatcgt 

cccagcagcc 

atcacgccga 

ccggaaagcg 

aacttgaggc 

gaaagcttga 

ggcagcggcg 

cagctggcgc 

tccgtctctt 

acggaggtat 

tacagcacga 

ttatagtgtt 

ccacgaatcg 

aaatattgct 

gagaacagac 

gcggagaaca 

ccggataat t 

actttgaagc 

ggatcggaga 

agatt t tcca 

atcagacgt t 

gccgccagcg 

cgcaggtagt 

gatttctgtt 

tccatgagcg 

ccttcggcat 

acgacaa tea 

gcgccccgaa 

tgtttgaagg 

acagtat tgg 

ctggaacctg 

actatttatg 

attctcgttg 

gtcgccaact 

aaagagcaga 

gaatatc tgc 

accgatcaga 

gtgatgeaga 

gggggttggg 

ggcgcgcagt 

t ttgacgacg 

cgctatagtt 

tattgtcttt 

atcgcctctg 

gcaataaccc 

atagecattt 

atcaatggt t 

ettgetttat 

aatggtt tta 

ttttcaatga 



gttccaccac 
gttcgttatc 
ctaacccggc 
tctctgccga 
gatcgecgat 
gattagtgaa 
tattttccac 
aggggegatt 
tataaeggeg 
cgccgcgatc 
ctttatgtac 
ccgccagcga 
cgccgcctgc 
gaacgecgag 
gcggttccag 
gaagctcctc 
ccagcgccgt 
gatcgatcac 
gategcagae 
gcgttttggt 
tgccttccgg 
geaegtcatg 
cgtggatctc 
caattgagat 
acaaaaactg 
acagagtgtc 
gtttgtcgcg 
cttcgatctt 
acaaaeggea 
aatgeatett 
catcgcagct 
aaaegtat tc 
gtaatctggg 
taacgctggc 
ctgaatggca 
gtaaaggt tc 
gtggccagga 
tgctgggtgg 
gtcagccgca 
cgcgtccaca 
acat teegtt 
ctcgtctgct 
gtatcccttt 
gccgactcgc 
ggattgtctt 
t ttgacaaca 
aattaaatgt 
ttttaattgc 
tttgtgtatt 
tttaagttat 



aatcacggta 
gcgctggtgc 
gectatctgg 
gegggagage 
ctcttttagc 
aaaatccatc 
aaatacatgg 
actgatgaac 
ctccatatta 
gttcatatat 
gttggcgctg 
cttgagcatt 
cagegacagg 
gccgtcacag 
ctcgcgcata 
cgctttctca 
t tcgaacgac 
cacctcaatg 
t tcgccgtca 
gtgttcgcct 
ctgtgacagc 
ateeggacaa 
ggtaatagta 
cgegggegat 
ccgcgcgtaa 
gaaagccagt 
ggggatgacg 
atccattcac 
gaaaeggcta 
cagctaaagt 
at taaaatgc 
aggagactca 
ccaggacccg 
tact tctgaa 
ccgggtggtg 
tcaggtgtat 
aegctatacg 
tcgccagggc 
acaacctcag 
gcaatccgcg 
ctgatttttg 
gaaatgcctg 
caaaaaagag 
ctcggcacat 
aaatatgaaa 
aaaagatat t 
ttgetttage 
tgaagataaa 
tttttacaca 
attgaagtcc 
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